ALGEBRAIC TRACE FUNCTIONS OVER THE PRIMES 



ETIENNE FOUVRY, EMMANUEL KOWALSKI, AND PHILIPPE MICHEL 



Abstract. We study sums over primes of trace functions of ^-adic sheaves. Using an extension of 
our earlier results on algebraic twist of modular forms to the case of Eisenstein series and bounds for 
Type II sums based on similar applications of the Riemann Hypothesis over finite fields, we prove 
general estimates with power-saving for such sums. We then derive various concrete applications. 
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1. Introduction 

Let f(X) = P{X)/Q(X) P,Q£ Z(X) be a non-zero rational function. If p is a prime large 
enough so that f(X) defines a rational function on F p by reduction modulo p, it follows from the 
work of Weil that we have the estimate 



E 

(Q(n),p)=l 



1/2 



(where Q{n)Q(n) = 1 (modp)) which exhibits considerable cancellation in this exponential sum. It 
is a natural question, with many potential applications, to ask whether such cancellation persists 
when the sum is restricted to prime numbers q, either less than p or over a shorter intervals (longer 
intervals being usually easier to handle). 

In [14], Fouvry and Michel proved that this is indeed almost always the case: 



Theorem 1.1. Assume that f = P/Q is not a polynomial of degree ^ 1. Then we have 



E 



q<p prime 
(Q(q),p)=l 
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for X ^ p and for any rj < 1/32, where the implicit constant depends only on the degrees of P and 
Q and on 1]. 



Similar estimates were already known when / is a polynomial (of degree > 1), but with the 
exponent 77 depending on the degree of / and tending to as the latter increased (see, e.g, [2TJ[iI]). 
Thus an important new feature in p3] was to allow for the most general possible rational fractions, 
and for a uniform p-power saving. One of the key input of the proof was an essential use of Deligne's 
theory of higher dimensional algebraic exponential sums. 

There are, however, many other functions defined over F p for which one would like to have 
similar results. For instance, with f{X) = P(X)/Q(X), one may naturally want to consider 



K{n) 



X (f(n)) if (p,Q(n)) = l 
if p\Q(n), 



for some non-trivial Dirichlet character x modulo p of order h ^ 2, provided the rational function 
/ is not an h-th power. Nevertheless, the only example we are aware of concerns the case where 
f(X) = aX+b, with b 7^ 0, is a polynomial of degree 1, which was considered first by Karatsuba [28] 
(and later |18| ) who obtained a positive power saving, which is non trivial whenever X ^ pV 2 + £ for 
some e > 0. In Corollary II. Ill we will prove a non-trivial bound for an (almost) arbitrary rational 
function /. 

Beyond additive and multiplicative characters, there are other function defined over finite fields 
which are now common tools in number theory. A nice example is given by the (normalized) hyper- 
Kloosterman sums, introduced by Deligne and studied by Katz in great detail in [29], which are 
defined by 

K(n) =Kl m (n;p) = ^^■■■E^ 1 ' + 



P 



for some integer m ^ 2. This example was first considered by the third author in [32H34] . who 

og log 1 

logp 



obtained a modest (yet non-trivial) saving of lo f^°^ p over the trivial bound 0(p/ logp) for the sum 



over primes q < p of Kl m (q;p). 

Remark 1.2. One can also wonder about the very recent generalizations of Kloosterman sums Kl^, 

associated to the general Kloosterman sheaves defined, for an arbitrary split reductive group G and 
a representation g of it, by Heinloth, Ngo and Yun [23], where the case of the hyper-Kloosterman 
sums above corresponds to G = GL n with its standard representation. It is a sign of the generality 
of our results that they do apply very straightforwardly to this case, although the corresponding 
trace functions have not (yet) been explicited for al£j] G! 

The common link between all these functions is that they are special cases of the general class of 
functions we called trace weights in |12| . with bounded conductors. Precisely, we have the following 
definition: 

Definition 1.3 (Trace weights). For a prime p and a prime £ 7^ p, an isotypic trace sheaf modulo 
p is a geometrically isotypic £-adic Fourier sheaf 3" on Ap^, in the sense of [291 Def. 8.2.2], which 
is pointwise pure of weight 0. 

An isotypic trace weight modulo p is the trace function 



1 As Ngo kindly informed us, Yun has computed these sums explicitly for G = SO(2n + 1) and its standard 
representation: Klso(3) is the symmetric square of KI2 and Klgo(2n+i) for n 2 is essentially the multiplicative 
convolution of Klso(3) an d of two Kloosterman sums Kl n , see |40] , 



for x £ F p of an isotypic trace sheaf 3~, this trace function being seen as complex-valued by means 
of some fixed isomorphism i : — > C. 

To any constructible sheaf 3" on A Fp is associated its analytic conductor, a numerical invariant 
which measures the complexity of 3". This is a positive integer defined by 

cond(J) = rank(3") + ^(1 + Swan,^)), 

X 

where x ranges over the (finitely many) singularities of 3" in P 1 (F P ), i.e., those x where 3" is not 
lisse, and Swan a; (3 r ) ^ is the Swan conductor of 'J at x (see [29] ). For an isotypic trace weight 
K(n), we define the conductor as the minimal conductor of an isotypic trace sheaf 3* with trace 
function equal to K(n) on F p . 

Remark 1.4. For example: 

• If K{n) = e(P(n)/p) for a polynomial P of degree < p, the associated sheaf has conductor 
cond(S^) = degP + 2; 

• If K(n) = x(P(n)) where x 1S multiplicative and P a polynomial, then it is bounded by 2 
plus the number of distinct zeros of P in F p ; 

• For the hyper Kloosterman sums K(n) = Kl m (n;p), the conductor is m + 3; 

• For the trace function of the £-adic Kloosterman sheaf associated to the adjoint representa- 
tion of the split reductive group G in [23], the conductor is bounded by dim(G) + 2 + r(G), 
where r(G) is the rank of G (by |23[ p. 4, (3)]: this sheaf is of dimension dim Ad = dimG, 
lisse on G m , tame at and with Swan conductor r(G) at oo). 

Our main result in this paper is an estimate for any sum over primes q of an isotypic trace 
function modulo p, which is universal in quality and gives power-saving whenever the length of the 
sum is roughly comparable with p on a logarithmic scale, in particular allowing some sums over 
shorter intervals q < p 1 ^ 9 for some > 0. The only weights we can not handle are those where the 
corresponding estimate would be tantamount to a "quasi-Riemann Hypothesis", i.e., a zero-free 
strip for some Dirichlet L-functions. 

It will therefore be natural to say that K(n) is an exceptional weight modulo p (for sums over 
primes) if it is proportional to a weight 

K xA n ) = X(n)ip(n) 

where x (resp. ip) is a multiplicative (resp. additive) character modulo p, where either or both may 
be trivial. 

Similarly, a sheaf 3~ will be called exceptional if it is geometrically isotypic and geometrically 
isomorphic to a sum of copies of a tensor product L x ®L^p of a Kummer sheaf with an Artin-Schreier 
sheaf, so that its trace function is exceptional. 

We state the results both for standard and for smoothed sums over primes. We will consider 
smooth test functions V, compactly supported in [1/2,2], such that 

(1.1) x j V {j \x) < Q j 

for some Q ^ 1 and for any integer j 0, where the implicit constant depends on j. 

Theorem 1.5 (Trace weights vs. primes). Let K be an isotypic trace weight on F p associated 
to some sheaf 3~, and assume that 3" is not exceptional. Let V be a smooth function as above 
satisfying (II. ID for some parameter Q ^ 1. 
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For any X ^ 2, we have 

(1.2) K{ q )v{^)^QX{l+p/X)^p-\ 

q prime 

(1.3) K{q)^X{l+p/X) l / l2 p-^\ 

q prime 
q^X 

for any n < 1/24. The implicit constants depend only on r\, cond(3~) and the implicit constants 
in (jl.ip . Moreover, the dependency on cond(3~) is at most polynomial. 

Remark 1.6. For X = p one gets 

£ K(q)«p^/^, 

q prime 
q<p 

and for general X these bounds are non-trivial as long as the conductor of 3" remains bounded 
and the range X is greater that p 3 / 4+e for some e > 0. Stronger results are available by different 
methods for special K. For instance, Bourgain [3] and Bourgain-Garaev [5j have obtained bounds 
for 

K{ n) = e (^+J^\ i k e N _ {0}) M = h 

which are non-trivial as long as X ^ p l / 2+£ (see also |17| for a survey of existing methods). 

Closely related to Theorem 11.51 is the following estimate: 

Theorem 1.7 (Trace weights vs. Mobius). Let /i denote the Mobius function. With the same 
notations and hypotheses as in Theorem 11.51 we have for X ^ 2 

J2»(n)K(n)V^) «QX(l+p/X) 1 /V r? , 

n 

fi(n)K(n) < X(l +p/X) l / 12 p-^\ 

for any r/ < 1/24, where the implicit constants depend only on rj, cond(3~) and the implicit constants 
in (jl.ip . and the dependency on cond(3") is at most polynomial. 

Remark 1.8. J. Bourgain pointed out that Theorem 2 of [6] (along with the Note following it) 
can be used in conjonction with (|3.2p and Proposition 13.11 of the present paper to prove that if K 
is an isotypic trace weight, we have, for any e > and for any X ^ p 1 / 2 + e 

YKn)K(n) = o(X), 

n^X 

where the implicit constant depends on cond(3~) and e. However, this approach does not seem to 
yield a power saving, and does not seem to apply if fi is replaced by A or by the characteristic 
function of the primes. 

Remark 1.9. This theorem expresses an orthogonality property (i.e., the absence of correlation) 
between the Mobius function and any isotypic trace weight modulo p with bounded conductor. 
This fits with the philosophy of the "Mobius randomness principle", formulated vaguely in |26t 
p. 338], and with Sarnak's recent precise formulation in terms of orthogonality of the Mobius 
function against function with low complexity, in the sense of entropy (see [37j). Our result is 
in a slightly different context than Sarnak's conjecture, however, since the trace weights K(n) are 
defined modulo p, and an asymptotic statement follows only by taking, for each p, a different weight 
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with some bound on the complexity, as measured by the conductor in our case. Thus, our results 
are closer in spirit to those of Green [20] and Bourgain [3] , which prove asymptotic orthogonality of 
the Mobius function against, respectively, bounded depth boolean functions, and monotone boolean 
functions on binary hypercubes {0, 1}^ identified with {1, . . . , 2^}. 

In fact, it seems to be a very intriguing question (suggested by Sarnak) to understand which 
functions can arise as small linear combinations of trace functions of low conductor (which, by 
linearity, still satisfy Theorems 11.51 and II. 7p . Another natural question is whether trace functions 
have low complexity in a more algorithmic sense, and this does not seem to be easy to answer. 
Only functions such as 

K(n) = e(^) or K{n) - ^ {n) 



p / v p 

(for P(X) G Z[X] fixed and (-) the Legendre symbol) seem to be obviously of low complexity for 
most meaning of the term, and such functions as 

K(n)=Kl 2 (n;p) 

are far from being understood in this respect. One can certainly expect the class of trace weights 
(and linear combinations with small coefficients) to be very rich and fascinating (in this respect, 
there are already hints in Deligne's conjecture about the number of trace functions with various 
conditions, see [8l[9l[13] for this topic.) 

1.1. First applications. We present here some corollaries of Theorem II. 51 which are obtained by 
applying it to specific weights K. This is only a selection and we expect many more applications. 
We leave to the reader to write down the corresponding statements involving the Mobius function, 
which follow from Theorem 11.71 

In the first result, we obtain a power saving in the sum of the error term in the prime number 
theorem in arithmetic progressions over residues classes modulo a prime which are values of some 
fixed polynomial. Precisely, we define E(X;p,a) by 

7r(X;p,a) = 6 J^l 7 r(X) + E(X;p,a), 

where 5 p (a) = if (a,p) ^ 1 and is 1 otherwise. 
We will prove in £ 15.11 : 

Corollary 1.10. Let P G Z[X] be a polynomial whose reduction modulo p is squarefree and non- 
constant. 

(1) We have 

J2 E(X;p,P(n)) « X(l + p/X) l l l2 p^ 

neF p 

for any r] < 1/48, where the implicit constant depends only on r\ and degP. 

(2) We have 

Y, E(X;p,a)^X(l+p/X) 1 / 12 p^ 

aSP(Fp) 

for any r/ < 1/48, where the implicit constant depends only on r\ and degP. 

The restriction to a squarefree polynomial could be relaxed, but some condition is needed in the 
current state of knowledge since for P = X 2 we have the interpretation 

E -E^Mf + i) 

n€F p q^X 1 1 
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in terms of average of the Legendre symbol over primes, from which we can not get power saving 
without using a quasi-Riemann Hypothesis for the corresponding L-function. 

On the other hand, if we take P = X 2 — 1, the study of either of these sums becomes equivalent 
to that of the sum 

q<X 

where q, again, runs over primes. This was estimated, as we recalled, by Karatsuba |28j . 
We can now generalize considerably this result of Karatsuba: 

Corollary 1.11 (Character sums over polynomially-shifted primes). Let f = P/Q be a rational 
function represented as a ratio of integral polynomials. Let x be a non-trivial Dirichlet character 
of prime modulus p and order h ^ 2. Assume that f modulo p is not of the form 

cX k g(X) h 

for some c S F p x ; some igZ and some g(X) G F P (X). We then have 

£ x(f(qW(q/X) « X{l+p/X) l l%-v 



q prime 



£ x(/(g))«^(i+pA) 1/1 V T?/2 



q prime 
q^X 

for any rj < 1/24, where the implicit constant depends only on rj, V and the degrees of P and Q. 

Proof. The weight K(n) = x(fi n )) is associated with the tame, rank-1, Kummer sheaf which 
has (at most) degP + degQ singularities, hence conductor bounded in terms of degP and degQ. 
It can only be exceptional if it is geometrically isomorphic to L x k ~ ^wx*) for some k ^ 0. But 
this is well-known to be precisely equivalent with f(X) = cX k g(X) h . □ 

We leave to the reader the statement of a result unifying this corollary with Theorem 11.11 for 

'g(n) 



K{n) = X (f(n))e 



P 



1.2. Kloosterman sums at prime arguments. Our last application involves the weights K(n) 
which are related to Kloosterman or hyper-Kloosterman sums Kl m . We first spell out two very 
specific corollaries for the standard Kloosterman sum in one variable: 

Corollary 1.12. For every < r/ < 1/48 there exists C(r]) such that for every p, every X ^ 2 and 
every integer n coprime with p, one has the inequalities 

£ Kh(nq;p)logq\ ^ C( V )X(1 + p/X) 1 ' 1 V 

q<X, q prime 

and 

£ Kl 2 (ny ;P ) e (^)logg| ^C(ri)X(l+p/X) l / l2 p-v. 

q<X, q prime 

These two bounds improve \27\ Lemmas 6.1, 6.2, 6.3] when c = p is a prime and when X 
is near and possibly a bit smaller than p. Using the methods of [27], one can use the second 
bound in conjunction and the Petersson formula to increase the size of the support of the Fourier 
transform $ of the test functions <3? in the problem of computing the distribution of low-lying zeros 
(average 1-level density) of the symmetric square L-functions L(sym 2 /, s) for / in the family of 
holomorphic newforms of prime level p —> +oo and weight k: with notation as in [27j (except that 
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they denote the level by N), there exists 5 > such that for any $ € S(R-) with the support of & 
in] — 1/2 — 5, 1/2 + 8[, one has 

V D(sjm 2 f^)= f ^>(x)W(Sp)(x)dx. 

p->oo \HT(p)\ ha 

The possibility of such an improvement was known to the authors of |27| (see [27\ Remark C, p. 
61]), though their method was different. 

The consideration of powers of hyper-Kloosterman sums allows us to strengthen the results (33j 
[34] concerning the existence of hyper-Kloosterman sums with large absolute value modulo a product 
of two primes: 

Corollary 1.13. For any m 2, there exists a constant a m > such that 

^A 2 (c)|Kl m (l;c)| > (a m + o m (l))XlogX, 

were A.2(c) = (/x*log 2 )(c) denotes the von Mangoldt function of order 2, which is supported over 
integers with at most two prime factors. 

This corollary shows that the normalized hyper-Kloosterman sums Kl m (l; c) whose modulus is a 
product of at most two primes have their size ~^> m 1 for a positive proportion of such moduli (when 
these are weighted by A2). In [3"3|l34]. the lower-bound was of order X, and by adding the missing 
logarithmic factor, we obtain the right order of magnitude. This answers a question of Bombieri to 
the third author from 1996. For A2 replaced by A3, a corresponding (easier) statement was proven 
in [16]. 

Another potential application of this corollary (or rather of the techniques used to prove it) is 
to reduce the value of the constant uj in the following statement, which was first established by 
Fouvry and Michel for uj = 23 in [15] subsequently improved to uj = 18 by Sivak-Fischler [381139] 
and to uj = 15 by Matomaki [31] : 

Theorem. The sequence (Ki2(l; c)) c j>i changes sign infinitely often as c varies over squarefree 
moduli with at most uj prime factors. 

1.3. Principle of the proof: the combinatorics of sums over primes. We start from a 
general perspective before explaining what features are specific to our case and what our new 
ingredients are. Thus we assume given a bounded arithmetic function K, a smooth function V, 
compactly supported in ]0, +oo[ and some X ^ 2, and we wish to obtain non-trivial bounds for the 
sum 

n 

where A denotes the von Mangoldt function. 

Using Heath-Brown's identity (see, e.g., [261 Prop. 13.3]) and a smooth partition of unity, this 
sum decomposes essentially into a linear combination of sums of the shape 

( L4 ) Yl a i( m i) • • • a k{m k ) Yl • ■ • y k( n k) 

mi,— ,m,k ni,---,n k 

T ,/mi • • • m k m ■ ■ ■ n k \ 

V y — J K (mi ■ ■ ■ m k ni ■■■n k ) 

for some integral parameter k ^ 2, where the ai(m) are essentially bounded arithmetic functions 
supported in dyadic intervals (say [Mi/2, Mj\) of short range (i.e. M- t X 1 ^), whereas the Vi(n) 

7 



are smooth functions supported in dyadic intervals with arbitrary range (say, [iVj/2, iVj] with iVj 6 
[1/2, 2X]), and where 

i 

We refer to the as the "smooth" variables and the mj as the "non-smooth" variables, as one 
is usually unable to exploit the specific shape of the functions on, except for the fact that they are 
supported in short ranges. 

Depending on which estimates and methods are available to bound these sums, according to the 
location of the point (Mi, • • • , M&, TVi, • • • , N^) in the 2fc-dimensional cube [1/2, 2X] 2k , it is useful 
to classify them into different (not necessarily disjoint) categories, based on the number of "long" 
smooth variables which are available: 

• If there is one very long smooth variable, say rti, one usually speaks of sums of type I, 
with the remaining (smooth and non-smooth variables) combined together into a single 
non-smooth variable, m, which means that the original sum (|1.4j) may be written 

E ^ E Vi( ni )v(^)K(mni). 

• If there are two relatively long smooth variables, say ni,ri2, one speaks of sums of type I2; 
after combining the remaining (smooth and non-smooth variables) into a single non-smooth 
variable, the sum can now be rewritten 

E «-EE F i( n i) y 2M^( ! ^)K(mn 1 n 2 ). 

• And if there are three relatively long smooth variables, say 711,712,^3, we will speak of sums 
of type I3, and so on. 

This classification appears more or less explicitly in the work |10j of Fouvry, in the context of 
the average distribution of primes in arithmetic progressions to large moduli. The implementation 
of this strategy depends on the possibility of dealing with the sums of type I r for r as large as 
possible, a question which becomes increasingly difficult as r increases, since the range of the 
smooth variables decreases^ All remaining sums belong then to the class of sums of type II. The 
most direct treatment of these sums -there may be other treatments available, depending on the 
original problem- consists in combining these (short) variables in subsets to form variables with 
larger ranges, in order to obtain bilinear forms involving two non-smooth variables of the type 

E E a mPnK(mn), where MN x X. 

m-^M nxiV 

One can then "smoothen" one of the variables, say n, by an application of the Cauchy-Schwarz 
inequality, leading to a quadratic form with coefficients with multiplicative correlation sums of the 
function K, namely 

E "^"ma E W (jf) K(min)K(m2n). 

mi,m2 n 

Notice here that the fact that the original variables are rather short actually helps, since it offers 
some flexibility in the ways they may be combined to tailor the relative ranges of M and N. This 
is the strategy we will follow in this paper. 



For instance, in [ID], it is shown that one could prove results on the distribution of primes in long arithmetic 
progressions on average, beyond the Bombieri- Vinogradov Theorem, if one could treat the corresponding sums of 
type I r for r — 1, . . . , 6. Currently, the sums of type I\, I2 and 73 can be handled [19]. 
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1.4. Sums of type I2. We can now come to our specific situation and explain our new results for 
sums over primes of trace weights. 

We will give estimates for sums of type /, I2 and II. In fact, the starting point of this work is a 
very general estimate for sums of type I2 (two long smooth variables of approximately equal size) 
when K is a trace weight, which follows relatively easily from the results of our earlier paper [12]. 
Indeed, using Mellin inversion, the estimation of sums of type I2 can be reduced to that of sums of 
the shape 

(1.5) §v,x(it,K) =Y J K{n)d it {n)v[^j 

n 

where t € R, and (for any u G C) we denote by 

d u (n) = d- u (n) = 2^ ^-J 

ab=n 

the twisted divisor function. 

We observe that the arithmetic function n —> dain) is (up to suitable normalization) the Fourier 
coefficient of the non-holomorphic unitary Eisenstein series 

M)=i 

for s = ^+it. The main result of our previous paper ([12, Thm 1.2]) is a universal non-trivial bound 
for the analogue of §y,x(it, K) where dn{n) is replaced with the Fourier coefficients of a classical 
cusp form (either holomorphic or not). We will extend the proof to Eisenstein series, obtaining the 
following result: 

Theorem 1.14 (Algebraic twists of Eisenstein series). Let K be an isotypic trace weight associated 
to the i-adic sheaf J modulo p. Let V be a smooth function satisfying (jl.ip with parameter Q ^ 1. 
If 3" is not geometrically trivial, then for any X ^ 1, we have 

§v,x(it,K) = Y J K(n)d tt {n)v(^) « (1 + \t\) A Qx(l + -| ) V V" 

n 

for any r] < 1/8 and some A ^ 1 possibly depending on r/. The implicit constant depends only on 
T), on the implicit constants in (jl.lh . and polynomially on the conductor of J. 

In fact, the proof of this theorem will be intertwined with the proof of the following estimate on 
sums of type 12- 

Theorem 1.15 (Type I2 sums of trace weights). Let K be an isotypic trace weight associated to 
the l-adic sheaf 3" modulo p. Let M,N,X ^ 1 be parameters with X/A ^ MN ^ X. Let U , V , 
W be smooth functions satisfying condition (jl.ip with respective parameters Qu,Qv and Qw ■ We 
then have 

m,n 

for t G R and for any rj < 1/8 and some constants A,B^1 depending on rj only. The implicit 
constant depends only on r\, on the implicit constants in (jl.ip . and polynomially on the conductor 
of?. 

Remark. (1) Through the techniques of |12| . this result depends on deep results of algebraic geom- 
etry, including Deligne's general form of the Riemann Hypothesis over finite fields, and the theory 
of the ^-adic Fourier transform of Deligne, Laumon and Katz. 

9 



(2) The Polya- Vinogradov method would yield a non trivial bound for the sum above as long 
as max(M, N) S> p 1 ^ 2 log p. Here we obtain non trivial estimates for MN S> p 3 / 4+e in particular 
when M, N >p 3 / 8+e . 

1.5. Sums of type / and II. Our second main result is a general estimate for sums of type 
//, which gives non-trivial bounds, as long as one of the variables has range slightly greater than 
p 1 / 2 logp and the other has non-trivial range. Precisely: 

Theorem 1.16. Let K be a non-exceptional trace weight modulo p associated to an isotypic l-adic 
sheaf J. Let M,N ^ 1 be parameters, and let (a m ) m , {(3 n )n be sequences supported on [M/2,2M] 
and [N/2,2N] respectively. 

(1) We have 

(1.6) £ £ a m p n K(mn) « ||a|| M(MN)^ (-I, + + pV y P ) , 

m,n " 
(m,p)=l 

where 

m n 

(2) We have 

(1.7) £ a m y> ( ™,)«(y> m |)iv( i 

(m,p)=l n^N m " 

In both estimates, the implicit constants depend only, and at most polynomially, on the conductor 
of 3". 

This theorem constitutes a significant generalization of results like |33|, Cor. 2.11] or |144 Prop. 1.3], 
which were obtained for very specific weights (additive characters of rational functions or symmet- 
ric powers of Kloosterman sums). The main difference is that we do not require any knowledge 
of the geometric monodromy group of 3". Instead, it turns out that we can build on the same 
ideas used in |12j to handle algebraic twists of cusp forms. A crucial role is played again by the 
£-adic Fourier transform, and by a geometric invariant of 3" which we introduced in [12], namely its 
Fourier- Mobius group, which controls the correlation of the trace function of the Fourier transform 
of 3~ with its pullbacks under automorphisms of the projective line. In fact, it is the intersection of 
this group with the standard Borel subgroup of upper-triangular matrices of PGL2(F p ) which we 
must understand, the essential point being that this intersection is of size bounded in terms of the 
conductor of 3" unless 3" is exceptional. This is the origin of this restriction in Theorem 11.161 It is 
rather remarkable that the upper-triangular matrices in the Fourier-Mobius group were precisely 
those which do not cause any difficulty in [12] (hence in Theorem II . 15|) . 

Remark 1.17. For the purpose of Theorem 11.51 it is indeed enough to handle sums of type I2 and 
to deal will all others as sums of type II. Other problems may require direct treatment of sums of 
type I r with r ^ 3 (see for instance the beautiful recent work of N. Pitt [36]). One might expect 
that this involves the theory of automorphic forms on GL r . 

Remark 1.18. In [14] , the first and third authors obtained bounds which could be stronger than 
(jl.6p and (|1.7p . in particular in ranges of M, N which are shorter than the Polya- Vinogradov 
range p 1 / 2 (see |14} Prop. 1.2 and Thm 1.4]). These bounds were established only for very special 
weights associated to rank one sheaves (additive characters of specific rational functions). It is quite 
conceivable that these results remain valid for more general trace weights, and we hope to come 
back to this question in a later work. From our current level of understanding at least, it seems 
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that, instead of the Fourier- Mobius group (or in addition to it), we would need to involve more 
precise information on the underlying sheaf, for example concerning fine details of its ramification 
behavior, and/or its geometric monodromy group. 

1.6. Acknowledgments. Part of this work was done during the 60th birthday conference of Roger 
Heath-Brown at Oxford. We would also like to thank the organizers, Tim Browning, David Ellwood 
and Jonathan Pila for this rich and pleasant week. Roger Heath-Brown has obtained fundamental 
results in the theory of the primes; it is no surprize that, once more, the celebrated "Heath-Brown 
identity" makes a crucial appearance in the present work. 

This paper has benefited from discussions and comments from Jean Bourgain, Satadal Ganguly, 
Paul Nelson, Richard Pink, Ngo Bao Chau, Peter Sarnak, Akshay Venkatesh, and Zhiwei Yun and 
it is a pleasure to thank them for their input. 

2. Algebraic twists of Eisenstein series and sums of type J 2 

In this section, we will prove Theorem 11.141 and Theorem 11.151 simultaneously. Indeed, the two 
results are very closely related, as we will first clarify. 

Let M, N,X ^ 1 with X/4 ^ MN ^ AX. Let t G R be given, as well as three smooth functions 
U, V, W satisfying (II. ip with respective parameters Qu, Qv, Qw- We package these parameters 
by denoting 

P = (U,V,W,M,N,X), 

and we denote 



n) \M) \N ) \X) 

m,n 

which is the sum involved in Theorem II. 151 

We start with a lemma relating the sums of type §>v,x(~, K) and Sp(-, K). 

Lemma 2.1. We adopt the above notations and for s £ C and x > 0, let 

W s (x) = W{x)x- S . 

(1) For every e > 0, there exists C = C{e), such that we have 

§ P (it,K)<. £ (Qu + Q v ) c + If iSw^xiih + i^K^dt^. 



ti\,\t 2 \<(MN) e 



(2) For every e > 0, one has 



§ vx (it,K) < e X £ max \S P (it,K)\, 

P=(Ui,Vi,V,M,N,X) 

where P runs over parameters (Ui, V±, V, M, N, X) as above with Qu 1 = Qy 1 = 1. 

Proof. (1) Denote by U and V the Mellin transforms of the smooth functions U and V. These are 
entire functions, which satisfy 

(2.D C /( S ),y( S )«(^±^) fc , 

for any k ^ 0, where the implicit constants depend on k, 9\es and the implicit constants in (jl.ip . 



We then have 



§ P (it,K) = 7 -J^ [ [ U(u)V(v)7 w (u,v)N u M v dudv 
{2my J {0) J {0) 



by Mellin inversion, where 



<J w (u,v)= K{mn)m u - U n- U - V w(^y 
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This sum can be expressed as a twist of Eisenstein series (|1.5p . namely 

7 w (u,v) = X-^§ WeitX (9 2 + it,K), 

where 

a u + v -u + v 

Thus, by a change of variable, we get 

2 f f - - /M\0* /MN\0i 

Sp( * t,K) = (2^)2 J J uih-foWh + fa)^) (— ) Sw ei ,x(02 + it,K)de 2 de 1 . 

The function Wg 1 is smooth and compactly supported on [1/2, 2]. For #i = 0, it satisfies (jl.lj) 
with parameter 

Q(0i)<Qw + N, 

where the implicit constant is absolute. 

Using (|2.ip for large enough, and the trivial bound 

§v,x(it,K) <X(logX), 

the contribution to this double integral of the region where \9\\ ^ {MN) e or 1 6*2 1 ^ (MN) £ is 

« (Qu + 

for some C depending only on e, which concludes the proof. 

(2) By a dyadic partition of unity (using Lemma f4. 31 below), and taking into account the support 
condition, we can decompose 2>y^x{i>t,K) into O(logX) sums of the shape 

§ P (it,K) 

where 

P = (U 1 ,V 1 ,V,M,N,X) 

with X/A ^ MN ^ 4X, and furthermore the functions U\, V\ satisfy condition (11. ip with parame- 
ters Qjj 1 = Qvi = 1- The result is then immediate. □ 

2.1. A simple bound. We start with the following simple "convexity" bound for the Eisenstein 
twists §v,x(it, K), which is useful for X ^ p, and will indeed imply both Theorem 11.151 and 
Theorem 11.141 for X ^ p 5 / 4 + e i 

Lemma 2.2. With the notation and assumptions of Theorem \1.14\ we have for any e > 0, 
(2.2) &v,x(it,K) « ( P QX(1 + \t\)) E (l + \t\) l / 2 Qx(^ + 

where the implicit constant depends on e and polynomially on cond(3~). 

Proof. This is relatively standard, so we will be fairly brief: the idea is to use periodicity of K 
and to represent it in terms of Dirichlet characters, reducing then to easy estimates for moments 
of Dirichlet L-functions. 

First of all, the contribution to Syx{it, K) of the integers n divisible by p is 

Y, K(0)d lt (n)vQ) ^ condm p^XlogX. 

n=0 (modp) 



Next, for (n,p) = 1, we can write 
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where x runs over the Dirichlet characters modulo p and 



*<X) = 7— W E K(m) ^ ( 



. . , , Am] 

VP 

mGF' 



is the finite-field Mellin transform of K. Thus we get 

(n,p)=l V ^ 7 X n 

The contribution of the trivial character xo to this sum is estimated by 



^ _ 1 1) i/ 2 i ^(xo) E Xo(n)dit(»)y (-^) <cond^)P 1/2 XlogX 
(indeed, since i<C is a trace weight, we have 

(p - l) 1/2 i^(X0) = J] " ^ (°) = ^ 1/2 ^(°) " #(0) «cond(J) P 1/2 , 

m£Fp 

where K denotes the unitarily normalized Fourier transform of K, so that —K is also an isotypic 
trace weight, associated to a sheaf with conductor bounded in terms of that of 3~ only, by the 
properties of the Fourier transform of ^-adic sheaves, as explained in [12, §1.4, Prop. 8.2].) 
For x non-trivial, denoting by V(s) the Mellin transform of V, we have 

E %)Ex(n)4(n)^g) = ^ j £ K(x)L(x,s + it)L( X ,s-it)V(s)X s ds, 

X+XV, n <ncs=l/2 x ^ Xo 

by a standard application of Mellin inversion and a contour shift. 
From (jl.ip . we get 

V(a) «, ' g V 



■1 + 1*1 

for all j ^ 0, where the implicit constant depends on j. Now, for any fixed e > 0, let S = Q 1+e , 
and split the s-integral into 

2i^ / " ' ~ 2i^ / "' + 2i^ / •••- / i + / 2, 

JRes=l/2 JRes=l/2 9%cs=l/2 

|3ms|^S |3ms|>5 

say. To handle I\, we apply Cauchy's inequality to obtain 

/ 2 <<{xs£|K( X )| 2 }x L s=1/2 J2\ L U,s-it)L(x,s + zt)\ 2 \ds\. 

By the Parseval identity, the first factor on the right-hand side is <C pXS. For the second factor, 
we apply the approximate functional equation to bound the product L(%, s — it)L(x, s+it) by sums 
of the shape 

X(n)d it (n) w / n \ 
^ n s VJV7 

for W rapidly decreasing and N <C p(l + S + (see, e.g., [26l Th. 5.3]). By a hybrid-large sieve 
estimate (see [351 Th. 6.4], compare with [261 Th. 7.34]), we can get the Lindelof conjecture on 
average for the integral and sum, and therefore derive 

L*=i/2 E \L(x,s-it)L( X ,s + it)\ 2 \ds\^ E (pS(l + \t\)) 1+s , 

J \3m{s)\^S XJ L X0 
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Using the rapid decay of V(s), a similar bound holds for I2, and this gives the desired result. □ 

For X ^ p 3 / 2 , for instance, the bound in this lemma is stronger than the one claimed in The- 
orem 11.141 Using Lemma 12.11 (1), we then also deduce Theorem 11.151 in this range. We will 
therefore assume for the remainder of this section that X ^ p 3 / 2 . Similarly, comparing the bounds 
of Theorems 11.141 and 11.151 with the trivial bounds 

§ v ,x {it, K) < X log X, &p(it,K)<&X log X, 

we may assume that X ^ p 3//4 . 

2.2. Spectral theory and amplification. The most important ingredient in the proof of Theo- 
rems HJ4] and HUH is the following proposition, which is proved with the methods of [12], based 
on the amplification method and Kuznetsov's formula. It is an averaged version of a bound for the 
amplified second moment of the sums §v,x(it, K). 
For t e R, L > 1 and «eC, let 

e<:2L 

I prime 

which is the amplifier (of length 2L) adapted to the Eisenstein series E(z, 1/2 + it). 
Lemma 2.3. For any e > there exists b = 6(e) ^ such that 

(2.3) / mm(\t\ 2 ,\t\- 2 - 2b )\B iT (it)§ VtX (it,K)\ 2 dt <. £ p £ (pLXQ + p 1/2 L 3 XQ(X/p + Q) 2 ) , 
provided 

LQ < p 1/4 , l^L^X. 

Proof. As in the cuspidal case in [12], we use the amplification method and the Kuznetsov formula, 
exploiting the fact that, for any given r £ R, the Eisenstein series 

(?rW £(2 ' 1/2+iT) 

occurs in the continuous spectrum of Hecke eigenforms of level p. More precisely, we have the 
Fourier expansion 

E(z, 1/2 + it) = y l l 2+a + y/2~<> 1/2 _ tf + y dudnDlnl-^Wu^Trln^einx), 

K > / t y 0(1/2 -Hi) d ( 1 / 2 + it )^ 

where 

9(s) = 7r- s r(s)C(2s), 

and 

-y/2 [Co T \ it-Ill 

(2.4) WM = t^t—tz / e-*x*~V 2 (l + -) dx 

denotes the Whittaker function (see for instance (3.29)]). 

We assume that the condition of the lemma are met. Using the notation of |121 Section 4] and 
taking there P = Xp -1 , we obtain as in |121 Prop. 4.1, (4.10)], the bound 

(2 - 5) pTi Jj-^ ^mm + w wxw' A ' )|2rft 



k>a—b 
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for any e > 0, where 2 ^ b < a are odd integers depending on e and 4>(t) = (f> a ,b(t) denotes a 
positive function such that 

4>(t) ^ b (i + |t|)- a - 2 . 

We then obtain the desired average estimate from this, using Stirling's formula and the bound 



C(l + 2tt) <C ^ + log(2 + |t| 



□ 



In order to apply this, we need some lower bound for the amplifier B^ T {it). The following lemma 
gets a suitable bound for t close enough to r. 

Lemma 2.4. For L large enough, we have 

L 



B iT {it) » 3 

losr l 



uniformly for t and r G R satisfying 



\t — t\ <C — -a — , and |t| ^ Ls . 
log 4 L' 



Proof. We observe first that for any prime i ^ 2L and \t — t\ <C log 4 L, we have 
\B iT (it) - B iT (ir)\ < ^ |^ T W-di t W| 



? prime 



= 2 ^ | cos(rlogf) - cos(tlog£)| 

^2L 
I prime 

<2|t-r| log£< 



K2L 
? prime 



log 4 L' 



and hence it suffices to prove the lower bound for t = t. 

Furthermore, we may clearly assume that r > (by parity) and that L ^ 3. We then have 

B lT ^r)= siga(d iT (e))d iT (£) = £ \d ir (£)\ = 2 ^ | cos(r log£)|, 

^2L ^2L £^2L 

£ prime £ prime i prime 

and since | cos(rlog£)| ^ 1 it is enough to prove that 



^cos(rlog^) 2 > 



(where I ranges over primes L < £ ^ 2L) under the assumption of the lemma. We do this by 
finding suitable sub-intervals where rlog£ is sufficiently far away from ir/2 modulo 7rZ. 
Consider the function 

g(x) = t logx 
for x G [L,2L]. It is non-decreasing and satisfies 

5 (2L)- 5 (L) = rlog2, g'(x) £ [^,1] for x G [L,2L]. 

In particular, if rlog2 ^ 2ir, the preimage g~ l {[— vr/4, tt/4] + 7rZ) intersects [X,2L] in 3> 1/r 
intervals of length ^ ^i. From Huxley's Theorem on primes in short intervals (see, e.g., [261 Th. 
10.4, Th. 5] and note that any of the variants of [261 Th. 10.5], going back to Hoheisel, would be 
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enough) the number of primes in any such interval is 3> T ^ gL , provided r L 5//12 £ for some fixed 
e > and L is large enough. Therefore (taking e = 1/12) we obtain 

\B iT (ir)\ » ^ 

provided 2vr/log2 ^ r ^ L 1 / 3 . 

At the other extreme, if ^ t ^ 1QQ ^ L , we have 

1 

,50 

for every L ^ £ ^ 2L, and hence Bi T {ir) S> L/logL also in that case. 
Suppose now that 

1 2vr 



cos 2 (r log ^) ^ cos 2 



100 log L log 2 

In that case, g([L,2L]) is an interval of length at least 1/(200 log L). It is then easy to see (the 
worse case is when the interval is symmetric around ir/2 + kir for some integer k) that there exists 
xo £ [L, 2L] such that 

i 

cos 2 (rlog(x )) ^ 



2 • 400 2 log 2 L ' 

Using again Huxley's Theorem, we know that the interval [L, 2L] n [xo — L 2 / 3 , xo + L 2 / 3 ] contains 
at least 3> L/logL primes, and since 

| cos 2 (rlog(^)) - cos 2 (rlog(x ))| < | log(i/a? )| « £~ 1/3 
for these primes, we have 

cos 2 (rlog(f)) » log~ 2 L, 

and therefore 

^r(«r) » 



log 3 L 

in that last case, which concludes the proof. □ 

This lemma and the average bound (|2.3|) allow us to deduce a first good upper-bound for the 
twists of Eisenstein series, averaged in rather short intervals. It will be convenient for later purposes 
to introduce the notation 

(2.6) I(r,p) = {t e~R \ \t-r\ log~ 4 p}, M(t,p) = max \& vx (it,K)\, 

tel(r,p) 

(2.7) M (Q,X)=Qx(l + ^y /2 p- 1 /\ 

so that, for instance, Theorem 11.141 claims that 

§ v<x {it,K) <.p £ (l + \t\) A M(Q,X) 

for any e > and A 1 depending on e. 
Our next result is: 

Proposition 2.5. For any r G R, we /lawe 

(2.8) / mm(\t\ 2 ,l)\§, v ,x(it,K)\ 2 dt<. £ p £ (l + \T\) B M(Q,X) 2 , 

Jl(T,p) 

for any e > and some B ^ 1 which depends only on e. 
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Proof. Let 



L 



X/p + Q 

as in [12] . If either L <C 1, or |r| > L 1 / 3 , the trivial bound 

§ v ,x(iT,K) ^XlogX 

or the convexity bound (|2.2p yield stronger results than (|2.8p (since B ^ 1). Otherwise, combining 
Lemma 12.41 with (|2.3p . we obtain 

/ min(|i| 2 , |tr 2 ^)|S y ,x(^, K)\ 2 dt « e + p^XQLf - + qY 

«^Q 2 X 2 (l + |)p- 1 /4 ; 

for some 1? depending on e, as desired. □ 

The remaining objective is to derive a pointwise bound for §v,x(it, K), and to do so we must 
relax the zero of order 2 of the weight min(|t| 2 , 1) at the origin (for similar issues with estimates of 
L-functions, see e.g. pQ; we could use similar methods, but at the expense of expressing our sums 
in terms of L-functions, and instead we use them directly, and resort to an iterative argument.) 

The basic mechanism is the following consequence of the proposition: 



Corollary 2.6. For any r € R and e > 0, with notation as above, we have 

\S v ,x(it,K)\dt < 



P £ (1 + \t\) b M(Q,X) if\r\^l, 
>I(T, P ) l p £ M(r,p) 1 /3 M (g,X) 2 / 3 i/|r|<l. 

Proof. If \t\ ^ 1, we just apply the Cauchy-Schwarz inequality to get 
§ v>x (it,K)\dt <; ' ' " ' ' 1 ' ' 

I(r,p) x Jl(r,p) ' x Jl(r,p) 



(f \S v , x (it,K)\ 2 dt) 1/2 ( [ dt) 

V/( T ,p) ' V/(r,p) ' 

< ( [ mm(\t\ 2 ,l)\S v ,x{it,K)\ 2 dt) 1/2 < P e {1 + \t\) b M(Q,X) 

K Jl(r,p) ' 



by Proposition 12.51 

Now assume |r| < 1. Let < a < 1/3 be some parameter. By Holder's inequality, we get 
S vx (it,K)\dt ^ M^p) 1 - 20 ( \§ vx (it,K)\ 2a dt 

I(t, P ) Jl(r,p) 

^ M{T,p) 1 - 2a ( [ \t\ 2 \S v , x (it,K)\ 2 dt) a ( f Itl-^/^-^dt) 1 '" 

« e , Q p e JA(r,p) x - 2a M{Q,X) 2a 
« £ , Q p s W{r,p) x - 2a M{Q,X) 2 l ? \ 

(since 2a/(l — a) < 1 and M(Q,X) ^ 1). By the trivial bound, we have 

M{T,p) l ~ 2a < M^p) 1 / 3 ^ logX) 1 - 2 "- 1 / 3 , 

and we conclude by taking a = 1/3 — e. □ 
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2.3. An iterative bound. The following lemma establishes an iterative bound for Eisenstein 
twists. 

Lemma 2.7. Assume that ft > is such that 

(2.9) §v,x(it,K) «p £ (l + |i|) A X /3 M(Q,X) 1 -' 3 
for X p 3 / 2 , any e > 0, and some A ^ 1 depending on e. Then we have 

(2.10) §v,x(^ K ) «/(l + |t|) Al ^ /?/3 M(Q,X) 1 - /3 / 3 

(2.11) S P (it, tf) < /(Qc/ + Qv) B {l + \t\) A ^X^M{Q w ,X) 1 -^ 
for A\, B 1 depending on e. 

Proof. Using Lemma 12.11 (1), we first use the assumption to estimate §p(it,K). For each t\, we 
split the integral over \tz\ ^ p £ into <C p E integrals over intervals of length log -4 p. For an interval 
I with center at r with |r| ^ 1, the integral is bounded by 

<^p £ M 1/3 M(Q w + |ti|,X) 2/3 

by Corollary 12.61 where 

M = max|§w tijX (^,-^)| <p e X /3 M(Q w + |*x|, -XT) 

by (|2.9|) applied to . Thus each such integral is 

« p £ X^ 3 M(Q w + \ti\,X) ^P/ 3 . 

For intervals centered at r with |r| ^ 1, we obtain the bound <C p e (l + |r|) j4 M(Qtv + |£i|), which 
is better, and integrating over \t\\ ?S p £ , we get (|2.11[) (note that Q >— )■ M(Q,X) is linear). 

Now, applying Lemma I2TT1 (2). we immediately deduce (|2.10p . □ 

We are now done: for X ^ p 3 / 2 , we can start applying this lemma with f3 = 1 by the trivial 
bound §v,x(it, K) <C XlogX. We deduce that, for any integer k ^ 1, we have 

Sv,x(»t,tf) < P £ (l + ItD^-'M^,^) 1 " 3 ^ 
S P (it, if) < p £ {Qu + Qv) B {l + ^X^MiQw^X) 1 - 3 '" . 

Since 

X 3 - k M(Q, X)'- 3 ' k = XQ^ (l + |) (1 ~ 3 " V V (1 ~ 3 ~ fe)/8 

^xQii+p/x^p-vy-"/ 8 , 

we therefore obtain Theorems 11.141 and 1 1 . 1 5 1 by taking k large enough. 

3. Estimating sums of type II 

In this section we prove Theorem 1 1.1 61 We will leave the proof of the simpler bound (|1.7p to the 
reader, and consider (|1.6p . proceeding along classical lines. Denoting 

T = a m (3 n K(mn) 

m,n 
(m,p)=l 

the bilinear form to estimate, we apply Cauchy's inequality and deduce that 

(3.1) |T| 2 ^ ||/3|| 2 Y, <W*m 2 Yl K( mi n)K(m 2 n). 

M/2<mi,m 2 <2M Af/2<n^2iV 
pfmim 2 
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The inner correlation coefficients are then treated by completion (i.e., by the Polya- Vinogradov 
method), which gives 

(3.2) V K{ mi n)K{m 2 n) < — \e{mi,m 2 ,0,K)\ + V minf — |C(mi, m 2 , h, K)\ 
^-^ p ^-^ \ \n \ p / 

N/2^n^2N 0<|/i|<p/2 1 1 

where 

C(mi,m2, h, K) = K(miz)K(m2z)e(^ — ^, 

a sum which satisfies the relation 

C(mi,m2, h, K) = Q(mi/m2, 1, h/rri2,K). 

For a trace weight, we have the trivial bound 

\e(mi,m2,h,K)\ ^ cond(3") 2 p, 

but this is not sharp in most cases. In fact, the crucial point is to show that for most parameters 
(mi,m2,/i), we have a better estimate with square-root cancellation. We provide such a result in 
Theorem 16.31 in Section [6l building on our earlier work in [12] . 

Proposition 3.1 (Paucity of large correlations). Let K be an irreducible trace weight modulo p 
which is not p- exceptional, associated to the sheaf 3. Then there exists C ^ 1, D ^ 0, depending 
only polynomially on cond(3~), such that 

\e(m,l,h,K)\ ^ Cp 1/2 

for every pair (m, h) £ F* x F p except for those in a set of pairs of cardinal at most D. 

After inserting (j3.2|) in ()3. 1 j) , the contribution of all triples {mi , 771,2 , h) for which 

\e(mx,m 2 ,h,K)\ ^ Cp 1/2 

is at most 

« \\<*f\\P\\ 2 (^ + Mp 1/2 logp). 

For the remaining triples, we sum over mi first. For each mi, the proposition shows that the 
possible (?ni/?Ti2, h/mi) that can occur lie, modulo p, in a finite set £ of size bounded in terms of 
the conductor of 3~ only, i.e., 777,2 modulo p and h are determined by mi up to a finite number of 
possibilities. We use the trivial bounds 

\Q(mi,m2,h,K)\ ^ cond(3~) 2 p, minf — , — ^ — , 



.\h\' p ) ^ p 

and obtain that the contribution of these terms to the right-hand side of ()3. 1 j) is 



« \w* EE E 



I f*mi 1 1 Ctm,2 I 



(t,h)e£ mi mi,m 2 

m,2=tm\ (modp) 



«ii/3ii 2 iv EE E (i« mi i 2 + i« m2 i 2 ) 



(t,h)eS. rni mi ,m2 

77i2=tmi (modp) 

«iv(i + ^)ll/3iriM 



2 11 „ M 2 



where the implicit constant depends only (polynomially) on cond(3~). 
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Combining the two, we get 

1/0/ 1 



T«\\a\\ m \(MN)^(4 ■ 1 ' " 1,OR,/2/) 



MV2 JVl/2 

where the implicit constant depends only on the conductor of 3~. This completes the proof of 
Theorem 11,161 

4. Sums over primes 
We now finally prove Theorem 11.51 our main result on sums over primes. 



4.1. Smooth sums. We start with the smooth version (ll.2p . Clearly, it is enough to estimate the 
sum 

§ V:X (A,K) = ^Mn)K(n)V^), 

n 

and we begin by recalling two lemmas. The first one is Heath-Brown's identity for the von Mangoldt 
function [22j: 

Lemma 4.1 (Heath-Brown). For any integer J ^ 1 and n < 2X , we have 

J 

A(n) 



E( _1 ) \j) E H m i)-"M m i) E logni ' 

j=l ^ ' mi,— ,mj^Z mi---rrijni---nj=n 



where Z = X 1 / 3 . 



Remark 4.2. Using instead the analogous formula 

M (n) = D-^'f" 7 ) E M("»0-/*K) E X ' 

for the Mobius function (valid under the same conditions), one proves Theorem 11.71 using exactly 
the same arguments, so we will not say more about the proof of that result. 

The second lemma provides a smooth partition of unity (see, e.g., |11| Lemma 2]). 

Lemma 4.3. There exists a sequence (V/);^o of smooth functions on [0, +oo[ such that 

- For any I, Vi is supported in ]2' _1 ,2 i+1 [; 

- For any k, I ^ 0, we have 

A w (i)< t i, 

where the implicit constant depends only on k; 

- For any x 1, 

Ew = i- 

Fix some J ^ 2. Applying these two lemmas, we see that Sy : x(A, K) decomposes into a linear 
combination, with coefficients bounded by Oj(logX), of 0(log 2J X) sums of the shape 

(4.1) S(M, AT) = ^2 E «i(™i)«2(™ 2 ) • • • uj(mj) 

•mjni • • - njj 

ni,--- ,raj 

where 
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mi,- ,mj 
X 



• M = (Mi,-- - ,Mj), N = (JVi,-- - ,Nj) are J-uples of parameters in [1/2,2X] 2J which 
satisfy 

m > iV 2 > • • • > Nj, Mi ^ X 1 /-^, M x • • • MjiVx • • • iVj xj X; 

• the arithmetic functions m \-t oti(m) are bounded and supported in [Mj/2, 2Mj]; 

• the smooth functions V^(x) are compactly supported in [iVj/2, 2iVj], and their derivatives 
satisfy 

y k V^(y)<l, 

where the implicit constants depend only on k. 

We will state different bounds for E(iVf,iV), depending on the relative sizes of the parameters, 
and then optimize the result. 

First of all, from (11.7P and integration by parts, we get the bound 



1 v 1/2 

(4.2) S(M, AT) « (pQfQXl + p 



vpV2 N\ 

for any e > 0, the implicit constant depending on e. 

Next, for J ^ 2, we obtain, by Theorem 11.151 applied to ni, n 2 and trivial summation over the 
remaining variables, the bound 

(4.3) E(M,N) « (pQ)-Qx(l + ^) 1/ V 1 /8. 
Finally from Theorem 11.161 with an integration by parts, we have the bound 

(4.4) E(M,N) « (PQTQX^ + ^ + 
for any factorization 

Mi • • • MjiVi ■■■ Nj = M x N 

where M and iV are products of some of the Mj and iV,- . 

Our goal is to choose the best of the three bounds above for each such configuration of the 
parameters (M,N). By taking logarithms (in base p), we readily see that the proof of (jl.2p is 
reduced to the optimization problem of the next section. 

4.2. An optimization problem. We consider here the following optimization problem. We are 
given a real number x > (we have in mind x = log X/ log p), an integer J ^ 3, and parameters 

(m, n) = (mi, • • • , mj,n u ■■■ ,nj) e [0, x] 2J 

such that 

(4.5) mi + nj = x, rrii^x/J, ni ^ n 2 ^ • • • ^ raj. 
We want to estimate from below the quantity 

/ \ f • /l 1\ /l °" x ~~ a 1\ 1 / 1/ / 

rj[m,n) = maxjmin , m - -J,maxmm^-, -, — -J, - - max(^0, -(1 - (nj + n 2 )) 

where a ranges over all possible sub-sums of the m; and nj for 1 ^ i,j ^ J, that is over the sums 

iea j'ea 

for J, ranging over all possible subsets of {1, • • • , J}. 
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The number r/(m, n) represent the maximal power of p that we save over the trivial bound by 
using either (|4.2p or f|4.3[) or (|4.4p . The outcome of the discussion in the previous section is that, 
for x = (log X)/ (log p) and J ^ 3, we have 

£j(M,iV) < (pQ) e QXp-^ m ' n \ 
By Heath-Brown's identity, it follows that 

§v{A,K) « (pQfQXp-v 

where 



T) = min 77(m, «.). 

(m,ra) 



We will show: 



Proposition 4.4. Let x > 3/4 be given. Provided J is large enough in terms of x, we have the 
inequality 

ii(m, n) ^ mini — , ) . 

,v ' ; ' V24' 24 J 

Combining these lower-bounds with the above estimates, the proof of Theorem 11.51 is concluded, 
noting that x ^ 1 means that X ^ p, and that 

Xp -(4x-3)/24 =x gy/ 6 ^l/24 > 

Proof of Proposition \4-4\ We have 

(4.6) 77(771, n) ^ max|maxmm^ — , — , — —j , — — max(0, —(1 — (m + ^2))) j- 

Let 5 be a parameter such that 

'Ax -3 x- 1/2 1' 



25, a; - - - 25 
'2 



(4.7) 0<<5<min v jo .. 
The interval 

Is 

is then non-empty. If we can find a subsum a such that a £ Is, we then deduce immediately that 

, . /I (T S 1 (7\ 

(4.8) r/(m,n) > maxmm^-,-,- - - - -J > <5. 

We now assume that such a subsum a does rzoi exist, and attempt to get a lower-bound on 
77(771, n) using the second term in the maximum (|4.6p . First of all, we claim that, in that case, we 
have 

(4.9) £mi<2<y, 
provided 

-j < x --- 4<5 = length^), 

a condition which we assume from now on. 

Indeed, if (|4.9p were false, using the fact that mi ^ 1/J and that 1/J is then less than the length 
of the interval Is, we would be able to find some subsum a (formed only with some mi's) which is 
contained in Is, contradicting our current assumption. 

From (|4.5p and (|4.9p . we get in particular the inequality 



nj ^ x — 25. 
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Since, under our assumption (J4.7P on 5, we have 



this implies that 



26 ^ x - - - 46 = length(I 5 ), 



rij ^ 25 



for any j ^ 3 (because otherwise, we would have 

X — — — 25 ^ 71-3 ^ Tl2 ^ Tli 

since rij ^ Is, and then, in view of (|4.7|) . we would get 

n-i + n 2 + n 3 > 3x — - — 65 ^ x, 
a contradiction). But now it follows that 
(4.10) E n i <25 ' 

because otherwise, using 45 ^ x — 1/2 — 25, we could again obtain a subsum of the n^'s, j ^ 3, in 

Combining (I4.9P and (14. lOf) . we obtain 

n\ + ri2 ^ x — 4(5 

and hence 

1 /I/ / xn\ /l 4x — 3 \ 
- - max^O, -(1 - (ni + n 2 )) J ^ mm^-, — 25 j. 

Combining this with (|4.8p . it follows that for <5 satisfying (|4.7|) and J large enough in terms of a; 
and 5, we have 

ry(m, n) min^<5, min^— , — 2^j 

For x > 3/4, we take 

. /4x-3 1 

5 = mm , — 

V 24 24, 

and Proposition 14.41 follows. □ 

4.3. Sums over intervals. We can now also easily deduce from (|1.2p the estimate (|1.3p for sums 
over primes in the interval 2 ^ q ^ X (below all sums over q are restricted to q prime) . By a dyadic 
decomposition of the interval [1,-X], we are reduced to proving that 

(4.H) Yl «.,cond CT X(l + p/X) 1 ' 1 

for X ^ 2 and for any rj < 1/24. Since the right-hand side of this bound increases with X, this is 
sufficient to conclude the proof of (jl.3p . 

Let A < 1 be some parameter and let V be a smooth function defined on [0, +oo[ such that 

supp(y) C [1 - A, 2 + A], 0<V<1, V(x) = 1 for 1 < x s: 2, 

and which satisfies 

aPV^\x) Q j , 

with Q = A' 1 . 
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By applying (|1 .2j) to V, we get 

K(g)«XA + ^K(g)F(|) 

X^q^2X q " ' 

«„,«md(<F) X(A + A- X (l 

for any rj < 1/24. 

If X > p 3 / 4 , we can take 

A = (1 + p/X) 1 / 1 2 p~^ 2 < 1 

and we obtain (j4.11|) . On the other hand, if X ^ p 3 ' 4 , the bound (|4.1ip is weaker than the trivial 
bound 2X for p large enough. 

5. Applications 

5.1. Primes represented by a polynomial modulo p. In this section we prove Corollaries II .101 
andCCIU 

For the former, we fix a non-constant polynomial P G Z[X], and we consider a prime p such that 
P is non-constant modulo p. 

For Corollary 11.101 (1), we are dealing with 

E(X;p,P(n)) = £ n(X;p,P(n)) - £ vr(X). 

nSF p nGFp nGFp 

P(n)^0 (modp) 

We denote 

iV P (x)= £ 1-1 

n£Fp 

P(n)=a; 

the "centered" number of representations of X clS cl value of P modulo p 
us to write 

E(X;p,P(n)) = £ Np(q) + £ (l - -^-|{n e F p 

neF p q^X q^X " 

(where g runs over primes, as before). 

The second term of the previous expression is trivially bounded by <C p v X + 1, since P has at 
most degP zeros modulo p. Thus Corollary 11.101 (1) follows from Theorem 11.51 and from the fact 
- recalled in Section T6.2I below - that Np is a trace function for an £-adic sheaf with no exceptional 
Jordan-Holder factor (i.e. no such factor is geometrically isomorphic to a tensor product of a 
Kummer sheaf and an Artin-Schreier sheaf). 

For Corollary 11.101 (2), we write lp(F ) for the characteristic function of the set P(F P ) of values 
of P modulo p, and we will denote P*(F p ) = P(F p ) — {0}, the set of non-zero values of P modulo 
p. A reasoning similar to the previous one leads to 

£ P(X; P ,a)=£l P(Fp) (,)-^M 7r(X ). 

aeP(Fp) q^X P 

Applying Proposition 16.71 of Section 16. 2| the first term on the right-hand side becomes 

Ci tt(X)+ £ ^KiW + Ofr^X + l) 

2^i^k q^X 
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. The formula above allows 
I P(n) + 0}|) 



where the implicit constant depends only on deg-P, using the notation of that proposition (the 
error term corresponds to the contribution of those q such that q(modp) is in one of the residue 
classes in the set S of Proposition 16.71 its size is bounded in terms of degP only.) 
Using the asymptotic formula (|6.3p for the constant ci, we get 

]T E(X;p,a)= £ ^K^ + Oip-^X), 
and Theorem 11.51 concludes the proof. 

5.2. Large Kloosterman sums with almost prime modulus. In this section we prove Corol- 
lary [LT3l It is sufficient to prove the following: 

Proposition 5.1. For any m ^ 2, and 8 such that < 5 < 1/2, there exists a constant f3 m > 
such that 

|{(P)?)> P>9 primes > X s , pq < X, \Kl m (l;pq)\ > /3 m }| > — . 

log A 

here the implicit constants depend on m and 5 only. 

We recall first the basic strategy from [33]. By the Chinese remainder theorem, we have the 
twisted multiplicativity 

(5.1) Kl m (l;pq) = K\ m (q m -p) K\ m {p m - q), 

when p and q are distinct primes. Therefore, in order to prove the existence of pairs of primes (p, q) 
for which | Kl m (l;pq)\ is large, it is sufficient to show that there exists two sets of pairs of primes 
for which \ Kl m (q m ;p)\ and | Kl m (p m ; g)| are both large, and that these two sets intersect non- 
trivially. This leads us to proving that, for pairs (p, q) in suitable ranges, the hyper-Kloosterman 
sums Kl m (q m ;p) and Kl m (p m ;g) become equidistributed in the interval [—m,m] with respect to a 
suitable measure. Such a statement is an instance of the vertical (or average) Sato- Tate laws of 
Katz and Deligne, but specialized to prime arguments. 

To state properly these equidistribution statements, we recall that for any prime number p and 
auxiliary prime t ^ p, and for any isomorphism i : <— > C, there exists a Q^-adic sheaf %£ m on 
Pp p (constructed by Deligne and studied by Katz in [29] ) such that: 

(1) The sheaf %£ m has rank m and is lisse on G mj -p p , tamely ramified at with a single 
Jordan block and wildly ramified at oo with Swan conductor 1 (in particular, we have 
cond(Kf) = m + 3); 

(2) The sheaf %t m is geometrically irreducible, and its geometric monodromy group is equal to 
G m = SL m or Sp m depending on whether m is odd or even; 

(3) The sheaf %£ m is pointwise pure of weight 0, and for any a G , the trace of the Frobenius 
at a equals 

fc(tr(BV a \X£ m )) = (-ir- 1 Kl m (a;p), 
and moreover, for any choice of maximal compact subgroup K m of G m (C), (Fr a |X£ m ) 
defines a unique conjugacy class gm(a;p) in K m with trace equal to (— l) m_1 Kl m (a;p). 
It will be easy to prove the following result using Theorem 11.51 

Theorem 5.2 (Sato- Tate equidistribution). Given 5 < 0, A ^ 1, P, Q ^ 2 such that 

the set of conjugacy classes 

{gl(r i ;p), P^Q Primes, (p,q) G [P,2P] x [Q,2Q]} C 
becomes equidistributed as P — > +oo with respect to the probability Haar measure fi m on Km- 
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Remark 5.3. A similar Sato- Tate equidistribution result over the primes holds for the generalized 
Kloosterman sheaves of Heinloth, Ngo and Yun [23] already mentioned in Remark 11.21 

We will sketch the proof below, but for the moment we can conclude from this the proof of 
Corollary 11.131 We pick a m > small enough such that 

» m ({g*eKl, |tr(0*)|^a m })>O.51, 

(such an a m exists because the direct image of the measure fi m under the trace map cp h-» | tr(<^)| 
is absolutely continuous with respect to the Lebesgue probability measure on [0,m]). 
Now let 5 > be given, let P be large enough and consider Q such that 

P 3/i+s < Q < P i/3 - 5 . 

We then have 

Q 3/4+6> ^ p ^ Q 4/3-6> 

for some 5' > depending only on 5, and we can apply Theorem 15.21 twice to show that both sets 
?i = {(P,9) G [P,2P] x [Q,2Q], v + q primes, | tr(^(g m ;p))| > a m } 
^2 = {(p,q) E [P,2P] x [Q,2Q], p / q primes, | tr(^(p m ; e?))| > a m } 

satisfy, as P — > +oo, the limit 



.({s* |tr(^)|^a m })>0.51. 



|{(p,g) G [P,2P] x [Q,2Q]}| 
In particular, the two sets have a non-empty intersection for P large enough, and in fact 

P Q 

log P log Q 

By ([53]), it follows that 

2 n , ^ P Q 



K(p, e [P 2P] x [Q,2Q], p + q primes, |Kl m (l;pg)| > q^}| > 



log P log Q 

Then we obtain by an easy argument of dyadic partition that for X large enough, we have pairs 
\{(p,q), P^q primes, , p,q^X^ 9 , pq G [JC,2X], |Kl m (l;pg)| ^ a^}| > 



log A" 
as claimed. 

Proof of Theorem \5.2\ . This is a direct application of the Weyl criterion. Let 

Xp,Q = {p + q primes, (p, qr) G [P, 2P] x [Q, 2Q]}. 
It is enough to prove that if g is a non-trivial irreducible representation of G m , we have 

(5-2) r^— tvg(gl(q m ;p))^0 

as P — > +oo. 

Now, for each p, we can interpret the sum over q as the sum of the weight 

K e (q)=tT Q (gl(q m ; P )) 

modulo p. Now we claim that, for each g ^ 1, the weight K Q is a non-exceptional irreducible trace 
weight modulo p with conductor bounded by a constant depending only on m and g. Assuming 
this, Theorem 11.51 (see 1 1 . 3 j) gives 

e t "^i(q m -,p))<<^ p p-^i+^y /12 
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for any 77 < 1/48. Dividing by |-Xj>q| X PQ/(logP)(log Q), we get 

— — £ tvg(gi(q m ;p)) « (log Q)(l + P/Q) 1/2 P^, 
1 P,Ql (P,g)ex P ,Q 

which tends to provided P 3 / 4 + 5 < Q < P A for some <5 > 0, A > 1. 
To check the claim, we first define 

X£' m = [i4 x~ m ]*^ 
so that, for a G F* , we have the trace function 

t((tr3CC)(F p ,a)) = (-l)™" 1 Kl m ( a - m ;p). 

The function 

tf e : a^tr(^(a- m ;p))) 
is then (the restriction to F* of) the irreducible trace weight associated to the sheaf g(%£') obtained 
by composing the representation %£' with the representation g. In particular, this sheaf is also lisse 
and geometrically irreducible on G m , and of rank dini£>. It is tame at oo, and its Swan conductor 
at is bounded in terms of m and dim^ only (by bounding the largest slope, see e.g. [33] ), so 
the conductor is bounded in terms of m and deg g only. Finally, because g(JC£') is irreducible of 
rank deg g 2 (we use here the fact that both SL m and Sp m have no non-trivial representations 
of dimension 1), it follows that g{%£') is not p-exceptional. □ 

6. Results from algebraic geometry 

6.1. Properties of the Fourier-Mobius group. The goal of this section is to prove Proposi- 
tion 13.11 In order to do so, we must first recall the definition of the Fourier-Mobius group of an 
isotypic sheaf 3", and establish a few of its properties which were not necessary in |12| . 

Let p be a prime. Let £ ^ p be an auxiliary prime number, i : ~ C an isomorphism. Let ip 
be the £-adic additive character such that i(ip(x)) = e(x/p) for x £ F p . 

Given any middle-extension sheaf 3" on Ap^, any finite extension k/F p and any x E P 1 (/c), we 
denote by 

(tr T)(k,x) 

the trace of the geometric Frobenius of k acting on the stalk of 3" at x. We also denote by D(3~) 
the middle-extension dual of 3* given by i*(i*3"), where j : U P 1 is the inclusion of any dense 
open set U on which 3" is lisse. 

If "J is any Fourier sheaf (in the sense of |29l Def. 8.2.2]) on Ap^, we denote by FT(3 r ) the Fourier 
transform of 3~, computed by means of ip, which satisfies 

(trFT(J))(F p ,y) = - £ (tr ^)(F p ,x)^xy) 

x£F p 

for any y £ F p . It is known that FT (3") is geometrically isotypic (resp. geometrically irreducible) 
if 3^ is. 

Let now 3~ be an isotypic trace sheaf modulo p as in Definition 11.31 In |12j . we defined the 
Fourier-Mobius group of 3" by 

G ? = { 7 £ PGL 2 (F P ) | 7 *(FT(5F)) * FT(3")}, 

where ~ denotes geometric isomorphism (see \12\ Def. 1.14]). Furthermore, we defined the corre- 
lation sums of 3~ by 

C(3~;7) = \ E (tr FT( 30)(F PI 7 • x)(tr FT(3~))(F p , x) 



V 
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for 7 E PGL 2 (F p ). 

The crucial link between these two notions is the following result (see [121 Cor. 9.2]) which 
follows from the Riemann Hypothesis over finite fields, and from bounds for the conductor of the 
Fourier transforms of Fourier sheaves. 

Proposition 6.1. Let p be a prime and let 3" be an isotypic trace sheaf modulo p. There exists 
M ^ 1, depending only polynomially on cond(3~), such that 

| t (e(ff-; 7 ))KMVp 

for all 7 i G 9 . 
Let then 

= n B, 

where B C PGL2 is the upper-triangular Borel subgroup. We deduce from the proposition above: 

Proposition 6.2. Let p be a prime, let 3~ be an isotypic trace sheaf modulo p, and let K(x) = 
i((tr 3~)(F p , x)) denote the trace function o/3~ on F p . There exists M ^ 1, depending only polyno- 
mially on cond(3~), such that 

I I ^ M^p- 

if 

(6-1) (0 J)**. 

Proof. By means of the Plancherel formula for the finite-field Fourier transform, we check easily 
that 

where 7 is the upper-triangular matrix in (|6.ip . Hence the proposition gives the result. □ 

It follows now that Proposition 13.11 is a consequence of the next theorem: 

Theorem 6.3. Let p be a prime and let 3~ be an isotypic sheaf. At least one of the following four 
properties holds: 

(1) The trace function of 3" is proportional to the trace function of a sheaf ^^^x) for some 
a E F p , i.e., to an additive character, or to a delta function at some point a E F p ; 

(2) The group Bg- has dimension ^ 1 and 3" is p- exceptional, i.e., its unique geometrically 
irreducible component is a tensor product L x <g> L v for some non-trivial Kummer sheaf L x and 
some possibly trivial additive character rj; 

(3) The group Bg- is finite and 

|B y (F p )| ^ 10cond(J) 2 ; 

(4) The conductor of 'J is at least (p/10) 1 / 2 . 

To prove this, we first prove two basic properties of the Fourier- Mobius group and one lemma 
concerning Swan conductors. 

Proposition 6.4. Let k be a finite field, and let 3^ be an i-adic isotypic trace sheaf on AjL Let S 
be its Fourier transform. Then the subgroup G$ C PGL^/c) is an algebraic subgroup defined over 
k. 

In particular, for over F p , Bgr is an algebraic subgroup o/B defined over F p . 
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We thank R. Pink for explaining to us how to prove this proposition. 



Proof. Let S C P be the divisor of singularities of 9, so that U = P — S is the largest open set 
on which it is lisse. Because S is non-constant (the sheaf 3" would have to be a Dirac delta sheaf 
supported on a single point for this to happen, and such a sheaf is not a Fourier sheaf), we have 
5^0. Let G C PGL2 be the stabilizer of S, which is a proper algebraic subgroup of PGL2 defined 
over F p . Then we have a first inclusion Gg- C G. 

Now we work over k, and just denote by U its base-change to k. We consider the action morphism 

(G x U — ► U 
^ \ (l, x) ^ 7 • X 

and the second projection P2 : G x U — > U, and we define the sheaf 

£ = M *g®p*D(g) 

on G x U and the higher direct-image J = R 2 p\\£,, which is a sheaf on the algebraic group G/k. By 
the base-change theorem for higher-direct images with compact support [7J Areata, IV, Th. 5.4], 
the stalk of J at a geometric point 7 E G(k) is naturally isomorphic to H 2 (U,^*2 &> D(S))- 

Furthermore, the constructibility theorem for higher direct images with compact support [TJ 
Areata, IV, Th. 6.2] shows that J is a constructible £-adic sheaf on G. This implies (see also [TJ 
Rapport, Prop. 2.5]) that for any d ^ 0, the set 

{7 E G(k) I dimJ 7 = dimF c 2 ([/,7*S <g> D(S)) = d} 

is constructible in G(k), i.e., is a finite union of locally-closed subsets. In particular, the set of all 
7 where 

F c 2 (C/,7*9®D(3))/0, 

is constructible. But this set is exactly Gg- by the co-invariant formula for H 2 on a curve (see |12^ Th. 
9.1]). Since it is well-known that a constructible subgroup of an algebraic group is Zariski-closed 
(see, e.g., [21 Ch. I, Prop. 1.3]) we conclude therefore that Ggr is a closed subgroup of PGL2. □ 

Next we need to understand when G5 can be "large" . We prove here a bit more than what we 
need for the sake of completeness. We use the notation T x ' y for the maximal torus in PGL2 defined 
as the pointwise stabilizer of {x, y} C P 1 (for x 7^ y) and U x for the unipotent radical of the Borel 
subgroup B a ' which is the stabilizer of 1 G P 1 . 

Proposition 6.5. Let 9" be a geometrically isotypic i-adic Fourier sheaf on Ap , with Fourier 
transform 3 = FT^,(3") with respect to some non-trivial additive character ip. 

(1) If there exists x E P 1 such that Gy D \J X , then 3 is geometrically isomorphic to a direct sum 
of copies of £-!/> (7 pn) f or some non-trivial additive character ipo, where 70 E PGL2 is such that 
70 • x = 00. In that case, we have Gg- = XJ X . 

(2) // there exist x 7^ y in P 1 such that Gy D T x ' y , then S is geometrically isomorphic to a direct 
sum of copies of ^xo(-yo(x)) f or some non-trivial multiplicative character xo, where 70 E PGL2 is 
such that 70 • x = 0, 70 • y = 00. In that case, we have Gg- = T x ' y if xo is not of order 2, and 
Gg- = N x ' y , the normalizer ofT x > y , if Xo = 1- 

Proof. (1) The "if" direction is immediate. For the converse, we may first assume that x = 00, by 
conjugation with a matrix 70 with 70 • x = 00. The assumption is then that 

Co 0'^ s < 
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for any t £ F p , where the symbol ~ denotes geometric isomorphism. Since S is geometrically 
isotypic, we also have 

(j ;)*9i«9.. 

for t G F p , where Si is the geometrically irreducible component of S- We can then apply |3CH Lemma 
2.6.13] to deduce that 

Si ^ £>tPo(X) 

(geometrically) for some additive £-adic character ipo, and hence S is a direct sum of copies of this 
Artin-Schreier sheaf. Furthermore, it follows from the classification of Artin-Schreier sheaves that 
if ipo is non-trivial and 7 ^ U°°, we do not have 7*£^ (x) — £i/> (x)> we hence the Fourier-Mobius 
group is exactly equal to U°°. 

(2) As before, we may first conjugate using some 70 to reduce to the case where x = 0, y = 00, 
and we may reduce to the case where 3~ and S are geometrically irreducible, so that the assumption 
is 

G,DT = T»- = {(<' »)} 
for all a, d G k. By [3Q|. Lemma 2.6.13], again, there exists a multiplicative character xo such that 

This character is non-trivial since S is a Fourier sheaf. Now to finish the computation of Gy, we 
use the fact that £ xo (x) is tamely ramified at and 00, and hence 

G 5 cN = N - oo = Tu{^ b M, 

the normalizer of T in PGL2. Clearly, T C G,c If 7 G N — T, on the other hand, we have 

7*3~ Xo (X) — J'xoix- 1 )' an d by the classification of Kummer sheaves, it follows that 7 G G/i x ^ {x) if 
and only if xo = Xo 5 i- e -> if Xo is of order 2. □ 

The second lemma concerns the size of Swan conductors of lisse sheaves on G m with some 
non-trivial (multiplicative) translation-invariance property. 

Lemma 6.6. Let k be an algebraic closure of a finite field of characteristic p, and let 3" be an l-adic 
sheaf for some t 7^ p which is lisse on G m ^- If there exists a / 1 in G m (k) such that 3" ~ [xa]*J, 
then m \ Swan^S"), where m is the multiplicative order of a. In particular, if 3~ is not tame at 00, 
we have Swan 00 (3") ^ m. 

Proof. Let V be the generic stalk of 3", seen as a representation of the inertia group / = I(oo) at 
00, and let 

v = ®v a 

be the decomposition of V in /-isotypic subspaces. Let W a denote the irreducible /-representation 
such that V a is a multiple of W a . 

The finite cyclic subgroup G C G m (k) of order m generated by a acts on the index set A, 
corresponding to the fact that [xa]*y = V as /-representation: we have 

[xa?]*V a = V a ;. a , 

for any integer j ^ 0, and in fact even 

[xa?]*W a = W a t. a , 

since W a is uniquely determined by V a . 
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Let B C A be one of the orbits of G. Its size \B\ is a divisor of m, and if a £ B, we have 
an isomorphism [xa' B ']*W a = W a . Since W a is irreducible, we can apply [291 Prop. 4.1.6 (2)] to 
deduce that 

Swan oc (M / ' Q ) = (modm/|S|). 
Since multiplicative translation by a is an automorphism, it follows that 

Swan 00 (M / aJ . a ) = Swan^H^) = (modm/|B|) 
for any a 3 G G. Summing over the orbit, we get 

Swanoo Vaj = (mod to), 

and then summing over the orbits we get 

Swanoo (V) = (mod to). 

If 3" is wild at infinity, than Swanoo (V) ^ 0, and therefore it must be ^ to. □ 

Having dealt with these preliminaries, we can now prove the theorem. 

Proof of Theorem \6.3[ The group B = By(F p ) is a finite subgroup of BnPGL2(F p ). We distinguish 
three situations in turn. 

(1) If B contains a non-trivial unipotent element g, then since g fixes oo, the reasoning in |12l §9, 
Proof of Th. 1.12] shows that either cond(S) ^ p, in which case the fourth case holds by [121 Prop. 
8.2 (1)], or otherwise the trace function of the Fourier transform FT^,(5F) is proportional to an 
additive character, so that the trace function of 3" is proportional to a delta function, and we are 
in the first case. 

Now, if B contains no unipotent elements, the unipotent radical of By must also be trivial 
(otherwise it would have non-trivial F p -points). So, by the structure of B, the connected component 
of By is contained in a conjugate (say D) of the diagonal subgroup in B. We continue with two 
further possibilities: 

(2) If By D D, we deduce from Proposition ^, 51 (2) that the Fourier transform of 3" is geometrically 
isomorphic to a direct sum of copies of £ X (7P0) ^ or some multiplicative character \ and some 7 GB. 
By Fourier transform, this implies that 3" is geometrically isomorphic to a direct sum of copies of 
the tensor product L x ® for some multiplicative character x an d some additive character 77. 
Here x must be non-trivial because otherwise J would not be a Fourier sheaf, and we are in the 
second case of the statement of the proposition. 

(3) Finally, if By is a finite group, its finite subgroup B C D is cyclic, and there exists xq G A 1 
such that all elements of B fix 00 and x$. Let 9 be the Fourier transform of 3. Replacing 9 with 
9o = [ — a?o]*9, which has the same conductor as 9j we can assume that xq = 0, and hence that B 
can be identified with a finite cyclic subgroup of F p acting on P 1 by multiplication. Let a G F p 
be a generator of B C F^ . There are two subcases: 

- (3.1) If 9o is not lisse on G m , there is a non-zero singularity s G G m of 9o! the geometric 
isomorphism 9o — [xa]*9o implies that the orbit of s under multiplication by powers of a is also 
contained in the set S of singularities of 9o- This set contains ^ \B\ elements, and therefore 

cond(9) = cond(9 ) > \S\ > \B\ 

in that case, and by [TH Prop. 8.2 (1)], we get 

\B\ < cond(9) < 10cond(3') 2 , 

i.e., case (3) of the theorem. 

- (3.2) If 9o is lisse on G m , we first note that 9o can not be tame at both and 00, since 
the tame fundamental group of G m is abelian and 9o would then be a Kummer sheaf, which we 
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excluded by assuming that is finite (again from Proposition 16.51 (2)). Up to applying a further 
automorphism x i— > we can assume that 9o is wildly ramified at oo. We can then apply 

Lemma [6761 to Soi an d deduce that 

Swanoo(go) > \B\, 

and hence we get again 

cond(S) = cond(So) ^ Swan^^) ^ \B\, 
and conclude as before. □ 



6.2. Decomposition of characteristic functions. In this section, we explain the necessary 
properties of the trace weights underlying Corollary 11.101 We recall especially the decomposition 
of the characteristic function of the set of values P{n) of a polynomial P G F P [X] with n G F p in 
terms of trace functions. These types of results are well-known, but we give the full proof since we 
require some quantitative information concerning this decomposition. 

Proposition 6.7. Let p be prime and let P G F p [X] be a non-constant polynomial of degree 
deg-P < p. Let 7 be the set of values of P modulo p and lp be its characteristic function. 

There exist a finite set S C F p with order at most deg P, an integer k ^ 1 and a finite number of 
trace functions Ki associated to middle- extension sheaves 9^, 1 ^ % ^ k, which are pointwise pure 
of weight 0, and algebraic numbers q G Q, such that 

(6.2) y }Tc i K i (x) = lp(x) 

i 

for all x G F p — S, and with the following properties: 

- The constants k, |c$| and cond(3~j) are bounded in terms of deg P only; 

- The sheaf 3~i is trivial and none of the 3~j for i ^ 1 are geometrically trivial, and furthermore 

(6.3) Cl = M + (p- 1 /2 )) 

p 

where the implicit constant depends only on deg P; 

- If P is squarefree, no Uj, i 7^ 1, contains an exceptional sheaf as a Jordan- Holder factor. 

Proof. Let K(x), for x G F p , denote the characteristic function of the set of values P{y) for y G F p , 
so that we are trying to express if as a linear combination of trace weights. 

Let D C A 1 be the critical points of P, S = P(D) C A 1 the set of critical values, so that P 
restricts to a finite etale covering 

V = A 1 -D^U = A 1 -S 

and let 

W V — >U 

be the Galois closure of V. The Galois group G = G&\(W/U) contains the subgroup H = 
Gal(W/V), and has order dividing deg(P)!, hence coprime to p. 

For any x G U(F P ), the Galois group G permutes the points of the fiber 7r _1 (x) C W, and 
this Galois action is isomorphic to the left-translation action on G/H. The Frobenius Fr XjP at x, 
seen as an element of G, also permutes the points of the fiber, and the subset of rational points 
ir^ 1 (x) n W(F p ) correspond bijectively to the fixed points of Fr X)P , and hence to the number of 
fixed points of Fr XjP acting on G/H, which is equal to the number of conjugates of Fr XjP that are 
in H. 
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More generally, if we consider the function 

(G — >Qe 



9 >-> 

the same argument implies that we have 



1 if g is conjugate to some h G H 
otherwise, 



K(x) = 6(Fr XjP ) 

for all x G U{F p ). 

The function 6 is invariant under G-conjugation. Hence, by character theory (since I ^ p, 
the Q^-linear representations of G can be identified with the C-linear representations) there exist 
coefficients c Q such that 

e 

where g runs over isomorphism classes of irreducible Q^-linear representations 

Q ■ G — > GL(V e ) 
of G and Xg = t r Q denotes the character of g. By composition 

A, : TTi (tf) — ► iri(U)/in(W) — G GL(V g ) 
each g determines an £-adic lisse sheaf A g on U which is pointwise pure of weight and satisfies 

X e ( F ^,p) = (trA e )(x,F p ) 
for all x G U(F p ). We therefore obtain 

K{x) = Y J O e K e 
e 

for x G U(F p ), where K g is the trace function of A g . 

We rearrange this slightly for convenience. Let 7 denote the set of g such that A g is geometrically 
trivial. We know that K e is a constant of weight 0, say a g , for g £ 7, and we define 7i = Qi, so 
K\{x) = 1, and 

ci = y^Cgpg. 
pea- 
Then we enumerate arbitrarily 

{gi7} = {g 2 ,.. .,Q k } 

and take 9~j = A ft , q = c ei . This gives the desired decomposition (|6.2p with S = S(F P ), which has 
^ |5| ^ degP elements. 

We now bound the numerical invariants in this decomposition. First, note that the number of 
non-zero summands is at most the number of g, i.e, the number of conjugacy classes in G, and 
hence is bounded in terms of deg P only. For any g we have 

1 1 geG 

which is bounded in terms of degP only (using very trivial bounds ^ dim^, \9(g)\ ^ 1 and 

the fact that the sum of squares of dim£> is equal to And since p \ |G|, all sheaves A g are 

tame, and since they are unramified outside S, we get 

cond(A £) ) ^ \S\ +dim£> 

which is again bounded in terms of deg P only. 
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Moreover, none of the sheaves A g can contain a Jordan-Holder factor geometrically isomorphic to 
£ x m (g) &ip(x) with ip non-trivial, since the are tamely ramified everywhere. If we assume that 
P is squarefree, is not a critical value, and all the sheaves A g are unramified at and therefore 
cannot have a non-trivial Kummer sheaf as (geometric) Jordan-Holder factor. Thus the sheaf 3~j 
does not contain an exceptional factor in this case. 

We conclude by proving f|6. 3j> : we have 



\7nu(F p )\ = ^ 9 ( Ft *,p) 

xeU(F p ) 

Q x£U(F p ) 

= Ci \u(f p )\+j2c s E 

80 x<=U(F p ) 

For each g which is not geometrically trivial, we can apply the Riemann Hypothesis to the inner 
sum, which shows it is <C p 1 ^ 2 with an implicit constant that depends only on degP (since the 
conductor of A e is bounded in terms of degP only). Since the number of g and the constants c g 
are also bounded in terms of deg P only, we obtain 

-^r + ° (!,1/2|c,<F ' )rl> - 

hence the result since p — degP ^ |[/(F p )| ^ p. □ 
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